NORM CONVERGENCE OF NILPOTENT ERGODIC AVERAGES ALONG F0LNER 

NETS 



PAVEL ZORIN-KRANICH 



=3 



Abstract. Multiple ergodic averages corresponding to polynomial sequences in nilpo- 
tent groups of measure preserving transformations converge in L^, as was proved re- 
cently by Walsh. We extend this result to averages along Folner nets in amenable 
groups. We also generalize the double ergodic theorem for commuting actions of an 
amenable group due to Bergelson, McCutcheon and Zhang to averages with arbitrarily 
many terms. 



1. Introduction 

Let r be a locally compact amenable group with a left Haar measure | ■ |, X be a 
probability space and G be a nilpotent group of unitary operators on L^(X) that act as 
isometric algebra homomorphisms on L°°(X) (thus the group G comes from a group 
of measure-preserving transformations). 

The derivative of a mapping g : F ^ G with respect to a, b e F is the mapping 
bSi^^) — gin.)~^ g{anb). A mapping g: F ^ G is called polynomial (of scalar degree 
< d) if for any Oi, . . . , a^+i, b^,..., b^-n e F the mapping D^^^t,, ■ ■ ■ Da,^„b,^,g equals 1^ 
identically. 

Theorem 1.1. Let g^, . . . , gj : F —> G be measurable polynomial mappings and fo,... ,fj e 
L°°(X) be arbitrary bounded functions. Then for every F0lner net (fa)aeA T and any 
choice of{a^)^f^ c F the limit 

1 f 

(1-2) 1™—— /ogi(rn)/i g/m)/j. 

a \a F \ 

exists in L^(X) and is independent of the F0lner net (fa)aeA "^""^ shifts (aa)agA- 



The study of nilpotent multiple ergodic averages (I1.2D is motivated by Leibman's 
generalization [Lei98] of the Furstenberg multiple recurrence theorem [|Fur77|] . Our 
proof closely follows the groundbreaking argument of Walsh [Wall 2] who considered 
the standard F0lner sequence in Z without shifts (a^) (his argument also works for the 
standard F0lner sequence in Z' ). In order to prove our result we have to address two 
additional issues not arising in the discrete setting F = Z*". 

The first is that the family of sets {aF^b}^ f,^Y-,a€A need not be directed by inclusion. 
However, it is directed by approximate (up to a small proportion) inclusion. This turns 
out to be sufficient for our purposes and gives uniform convergence over two-sided 
shifts aF^b as a byproduct. 

The second issue is that in general a function from a directed set to itself, unlike a 
sequence of natural numbers, cannot be majorized by a monotone function. Thus we 
have to avoid to pass to monotone functions. 

Our main result (Theorem |9.1D also implies the following generalization of the dou- 
ble ergodic theorem for commuting actions of an amenable group due to Bergelson, 
McCutcheon and Zhang [BMZ97, Theorem 4.8]. 
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Theorem 1.3. Let Ti,...,Tj be measure-preserving actions of F on X that commute 
pairwise in the sense that 

T;(m)T;t(n) = Tj.(n)T;(m) whenever m, neP, i/fc 

and /o, . . . ,/j e L°°{X) be arbitrary bounded functions. Then for every F0lner net [Fa^aeA 
in r the limit 

c 

/o(-^)/i(Ti(m)x) /;(Ti(m) . . .Tj{m)x) 



(1.4) lim-i 



meF, 

2 



exists in L (X) and is independent of the F0lner net {F^)^^. 

The remaining part of this article is organized as follows. We begin with a very 
brief overview of other methods by which our problem has been studied in Section [2j 
A close look at the von Neumann mean ergodic theorem in Section |3] motivates the 
Walsh structure theorem. We continue with algebraic preliminaries in Section |4] and 
a streamlined treatment of Walsh's complexity in Section [5] The properties of F0lner 
nets that are important for us are reviewed in Section [S] Sections [7] and [8] contain 
slight variations of the abstract structure theorem due to Gowers and Walsh and the 
inverse theorem due to Walsh, respectively. Finally we prove the main result in Sec- 
tion [91 In Appendix |A| we prove a right polynomial version of Theorem ll.il as well as 
Theorem [LSI 

Our argument is presented for F0lner sequences but applies directly to F0lner nets 
upon replacing the natural numbers by the directed set A throughout. Without loss of 
generality we work with real- valued functions onX. 

Acknowledgment. The author thanks Vitaly Bergelson for bringing the problem of 
convergence of the averages ( I1.2D to his attention. 



2. Convergence of multiple averages by other methods 

Before Walsh's proof there existed basically two approaches to norm convergence of 
averages of the form (II. 2D : via characteristic factors and via analogs of the h5^ergraph 
regularity lemma. We do not attempt to give a full account of the history (an extensive 
bibliography on this and many related topics can be found in Frantzikinakis's survey 
IIFrallin and only point out some developmets that occured since 2005. 

(Partial) characteristic factors. Characteristic factors have been explicitely introduced 
by Furstenberg and Weiss [|FW96|] . The general idea is to take expectations onto some 
factors with desirable algebraic properties, show that this does not change the limiting 
behavior of the averages and analyze the averages on the factors. 

The single most important kind of factors in this area are maximal nilfactors. They 
have been constructed independently by Host and Kra LHK0 5 ] and Ziegler [,Zie07i1 us- 
ing some earlier ideas of Conze and Lesigne. The two constructions lead to identical 
factor^ but the approach of Host and Kra has the advantage that the factors are char- 
acterized in terms of uniformity seminorms (introduced in a special case by Bergelson 
mirOl). 

By PET induction f Ber87ll maximal nilfactors are characteristic for arbitrary polyno- 
mial averages involving one measure-preserving transformation IILeiOSII . Convergence 
of the averages follows from Leibman's pointwise multiple ergodic theorem for nilsys- 
tems [LeiOSb ]. This information also allowed to characterize families of polynomials 
for which a multiple recurrence theorem holds [,BLL08J . 



^In Ziegler's terminology fc-universal characteristic factors i^CX) are the maximal {k — l)-step pro-nil- 
factors |Zie07| Theorem 1.8] and the minimal fc-characterstic factors, so they coincide with Host-Kra factors 
since the latter are pro-nilsystems and fc-characteristic. 
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Commuting transformations. For commuting transformations one cannot in general ex- 
pect nilfactors to be partially characteristic: this already fails for the simplest averages 
II^neF„ ^"/i ^2 /2 ^^th Tj = Tj. But, under fairly mild assumptions that prevent such 
degeneracies, nilfactors do in fact happen to be partially characteristic. 

The first possibility is to impose additional ergodicity assumptions on the commut- 
ing transformations T,. Assume for instance that each and T^T^T^, i ^ i' , is ergodic. 
Then the nilfactors associated to the different T^'s coincide and are characteristic for 
linear multiple averages involving the r,'s IIFK05II . Moreover, if (T^, . . . , T;) is a "totally 
ergodic generating set", meaning that T^' . . . T'' is ergodic unless = • • • = c; = 0, 
then the nilfactors are also characteristic for polynomial averages [Joh09,] . 

The second possibility is to consider distinct degree polynomial iterates of general 
commuting transformations T; (without any ergodicity assumptions). In this case the 
nilfactors associated to the T^'s are partially characteristic [.CFHll.l l^ 

Continuous time. Characteristic nilfactors for R-actions have been constructed by Potts 
[|Pot09|] who used them to prove norm convergence of polynomial multiple averages 
with continuous time. This problem can also be reduced to the corresponding problem 
for Z-actions IIBLMllll . 

Distal characteristic factors. Not every characteristic factor is created nilpotent. In 
fact, convergence of certain two-term averages corresponding to commuting actions 
of amenable groups [BMZ97] and Z-actions that generate a nilpotent group IIBL02II 
has been established using partial characteristic factors that are compact extensions of 
Kronecker factors without any appeal to a nilpotent structure!! 

Hypergraph regularity. Another approach to convergence of multiple ergodic aver- 
ages utilizes ideas surrounding the (infinitary) hypergraph regularity lemma. It has 
been pioneered by Tao in his proof of convergence of linear multiple averages corre- 
sponding to arbitrary commuting transformations IITaoOSII . His proof was unconven- 
tional in two more aspects: the result was deduced from a finitary statement using 
a Furstenberg correspondence principle backwards and that finitary statement was a 
manifestation of the no-counterexample interpretation of convergence (Towsner later 
gave a non-standard proof l|Tow09ll that uses some of Tao's ideas) . 

Austin introduced the important device of "pleasant extensions" IIAuslOII that in 
turn admit nicer characteristic factors than the original system does. This inspired 
Host to devise "magic extensions" that have similar properties ||Hos09ll . 

Although these methods deal with several commuting transformations in a very nat- 
ural way, they do not seem to extend to the polynomial setting easily, basically because 
the Furstenberg self-joining lacks obvious invariance properties. Some progress in this 
direction includes Austin's proof that the averages j|Xn=i T"^fT"^S"g converge pro- 
vided only that that T and S commute [Aus09..Aus09bl1 . The situation is better in 
the continuous time setting, where the Furstenberg self-joining is invariant under the 
off-diagonal flow [lAusllbl Theorem 3.4(C)], IIAusllI Theorem 1.1] Q 



For ergodic Tj's appeals to the theory of uniformity seminorms on in that article could be replaced 
by "vertical Fourier analysis". 

^The latter paper contains an example of operators T and S that generate a solvable group of class 2 
such that the averages jj 2]n=i T^fS^S do not converge. 

^e remark that our Theorem I 1 . 1 [ provides convergence of the averages in IIAusllI Theorem 1.2] on the 
joining (and not only of their expectations on the first factor) but fails to produce the invariance. 



4 



PAVEL ZORIN-KRANICH 



3. A CLOSE LOOK AT THE VON NEUMANN MEAN ERGODIC THEOREM 



In this section we will illustrate Kreisel's no-counterexample interpretation of con- 
vergence with a (known) quantitative version of the von Neumann mean ergodic the- 
orem for multiplicators on the unit circle T. This tangible setting is used to explain 
several ideas that occur in the proof of Theorem ll.il 

Throughout this section /x denotes a Borel measure on T and Uf{X) = A/ (A) is a 
multiplicator on L^(T, fx). 

Multiplicator von Neumann Theorem 3.1. Let fi and U be as above. Then the ergodic 
averages = 5^Xii<„<N U"f converge in L^ifx). 

Proof. The averages a^j are dominated by |/| and converge pointwise, namely to / (I) 
at 1 and to zero elsewhere. □ 

It is well-known that no uniform bound on the rate of convergence of the ergodic 
averages can be given even if U is similar to the Koopman operator of a measure- 
preserving transformation ||Kre78ll . However there does exist a uniform bound on the 
rate of metastability of the ergodic averages. Let us recall the concept of metastability. 
The sequence (a^) converges if and only if it is Cauchy i.e. 



we see that this is equivalent to 

3e > 03F: N^NVM3JV,JV': M < AT, JV' < F(M), ||a^ -a^/||2 > e. 

Negating this we obtain that (a^) is Cauchy if and only if 

Ve>0VF:N^N3MVJV,JV'(M<JV,JV'<f(M) =^ ||a^ - ajv'llz < e). 

This kind of condition, namely that the oscillation of a function is small on a finite 
interval is called metastability. A bound on the rate of metastability is a bound on M 
that may depend on e and F but not the sequence (a^)Ar. 

The appropriate reformulation of the von Neumann mean ergodic theorem for the 
operator U in terms of metastability reads as follows. 

Multiplicator von Neumann Theorem 3.2 (finitary version). Let fj, and U be as above. 
Then for every e > 0, every function F : N ^ N and every f e L^[^jl) there exists a number 
M such that for every M <N,N' < F(M) we have 



Although Theorem l3.2l is equivalent to Theorem l3.1l bv the above considerations we 
now attempt to prove it as stated. 

Proof of Theorem \3^ It clearly suffices to consider strictly monotonically increasing 
functions F. Let us assume II/II2 = 1, take an arbitrary M and see what can be said 
about the averages in ( 13. 3D . 

Suppose first that / is supported near 1, say on the disc Aj\f with radius -^^^ and 
center 1. Then U"f is independent of n up to a relative error of | provided that 
n < F(M), hence both averages are nearly equal. 

Suppose now that the support of / is bounded away from 1, say / is supported 
on the complement Bjy, of the disc with radius and center 1. Then the exponential 



Ve > 03MVAr,JV'(M < JV,JV' =^ ||ajv - ajv'llz < e). 
The negation of this statement, i.e. "(cijv) is not Cauchy" reads 

3e > 0VM3JV,N': M <N,N', ||ajv -a^'Hz > e. 
Choosing witnesses N(M), JV'(M) for each M and defining 

F(M) = max{JV(M),JV'(M)} 



(3.3) 
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sums ^ Xii<n<N ^" bounded by | for all A in the support of / provided that N >M, 
hence both averages are small. 

However, there is an annulus whose intersection with the unit circle E^j = T\ (A^ U 
Bjvf) does not fall in any of the two cases. The key insight is that the regions E^. can 
be made pairwise disjoint if one chooses a sufficiently rapidly growing sequence (M,);, 
for instance it suffices to ensure < , . 

Given / with II/II2 < 1 we can by the pigeonhole principle find an i such that 
ll/^Mj II2 < (here we identify sets with their characteristic functions). Thus we can 
split 

(3.4) f=cr+u + v, 

where a = fA^, is "structured", u = fB^, is "pseudorandom" and v = fE^, is L^- 
small. By the above considerations we obtain 03.30 for all M, < JV,JV' < F(M;). □ 

Observe that the sequence (MJ; in the foregoing proof does not depend on the 
measure fi. Moreover a finite number of disjoint regions E^, suffices to ensure that 
fE^, is small for some i. This yields the following strengthening of the von Neumann 
theorem. 

Multiplicator von Neumann Theorem 3.5 (quantitative version). For every e > and 
every function F : N ^ N there exist natural numbers M^, . . . ,Mg; such that for every /i, 
and every f e with II/II2 < 1 there exists an i such that for every M,- < N,N' < 

F(Mi) we have 

I 1 1 

- > [/7-— y u"f <e, 

JV ^ iV' 2 

l<n<Ar l<n<N' 

where Uf{X) = A/ (A) is a multiplicator as above. 

The spectral theorem or the Herglotz-Bochner theorem can be used to deduce a 
similar result for any unitary operator The argument of Avigad, Gerhardy and Towsner 
IIAGTIOI Theorem 2.16] gives a similar result for arbitrary contractions on Hilbert 
spaces. An even more precise result regarding contractions on uniformly convex spaces 
has been recently obtained by Avigad and Rute [AR12]. 

Quantitative statements similar to Theorem 13.51 with uniform bounds that do not 
depend on the particular measure preserving system allow us to use a certain induction 
argument that breaks down if this uniformity is disregarded. A decomposition of the 
form ( I3.4D , albeit a much more elaborate one (Structure Theorem |7.2| ), will also play 
a prominent role. 

4. Polynomial mappings 

The definition of pol)momial mappings given in the introduction has a serious dis- 
advantage: unlike in the commutative case, the pol5momials of scalar degree < d need 
not form a group under pointwise multiplication. 

This flaw has been rectified for poljmomial mappings from groups into nilpotent 
groups by Leibman who introduced the notion of vector degree and showed that poly- 
nomial mappings of given vector degree form a group under pointwise operations 
[|Lei02[ Proposition 3.7 and erratum]. In this section we give a short review of this re- 
sult in terms of prefiltrations instead of vector degrees. A prefiltration G, is a sequence 
of subgroups 

(4.1) Go>Gi>G2>... such that [G,, G^] c G,+j for all i,; e N = {0, 1, . . . }. 

A filtration (on a group G) is a prefiltration in which Gq = Gj (and Gq = G). A 
prefiltration is said to have length d e N if G^j+j is the trivial group and length —00 
if Gq is the trivial group. In this article we only consider prefiltrations for which one 
of these alternatives holds. If G is a nilpotent group then the lower central series is 
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a filtration. For a prefiltration G. of length d we define G.+t to be the prefiltration of 
length d — t i= — oo if d < t) on G given by (G.+J^ = G,.,_f . 

Recall that the discrete derivative of a map g : F ^ G is defined by 

Da,b8W = 8~\n)T^ i,gin), where r^,bg(n) = g(,anb). 

We define G. -polynomial maps by induction on the length of the prefiltration. 

Definition 4.2. Let G. be a prefiltration of length d e {-00} u N. A map g: F Gq 
is called G.-polynomial if either d = —00 (so that g = Ig) or for every a, b e F the 
discrete derivative f,g is G.+j-polynomial. 

In other words, a map g : F ^ G is polynomial if every discrete derivative is polyno- 
mial "of lower degree" (in fact we prefer not to define a "degree" for our polynomials 
since it is necessary to keep track of the prefiltration G. anyway). 

Recall that in the introduction we have defined a map g : F ^ G into a nilpotent 
group to be pol5momial of scalar degree < d if and only if 

Da,M ■ ■ ■ ^a.^ubi+iS = 1g for any a^,..., a^+i, bi, . . . , bd+i ^ F. 

This is easily seen to be equivalent with the present definition. Indeed, a polynomial 
mapping of scalar degree < d is polynomial with respect to the filtration 

(4.3) Go > Gi > • • ■ > Gi > • • • > G, > • • • > G, > G,+i 

d times d times 

(that refines the lower central series Gq = > ■ ■ ■ > G^> G^+j of G) since the (d -I- 1)- 
th derivative of g vanishes identically and in particular takes values in Gj (note that 
one cannot a priori say anything about the d-th derivative). On the other hand a 
G.-polynomial mapping has scalar degree < d. 

Thus there is no ambiguity in calling a mapping polynomial. Moreover this shows 
that for any finite set of polynomial maps into a nilpotent group there is a filtration 
with respect to which they are all polynomial. 

The next theorem is a version of Leibman's result that polynomial mappings form a 
group. It is a special case of results in IIZK12II with 

(4.4) ^ = {Tj, b : F ^ F,n anb, where a,b& F}. 

Theorem 4.5. The set of G.-polynomial maps F ^ G is a group under pointwise oper- 
ations and is shift-invariant in the sense that for every G.-polynomial g and a, b e F 
the translate T^ i,g is also G.-polynomial. Moreover for any G.^^.-polynomial maps 
gi'. r —> G, i = 0, 1, the commutator [go,gi] is G.^^^^^^-polynomial. 

Note that the second claim follows from the identity Tg = gDjg. 

Remark 4.6. The polynomial maps of finite scalar degree fail to form a group if G is 
replaced by the dihedral group D3 that is the smallest non-nilpotent group. Indeed, let 
5 be a rotation and a a reflection in D3, then 5^ = = (cr5)^ = 1. The sequences 
(. . . , cr, 1, CT, 1, . . . ) and (. . . , a5, 1, a5, 1, . . . ) are polynomial (of scalar degree < 1) but 
their pointwise product (. . . , 5, 1, 5, 1, . . . ) is not. 

If F = Z'' or F = R'' then examples of polynomial mappings are readily obtained 
considering g(n) = r^i'-"-' j-piM^ where p,- : Z'' ^ Z (resp. K'' R) are conven- 
tional polynomials and -.Z^ G (resp. M ^ G) are one-parameter subgroups. 

For noncommutative groups F it is not evident that there exist any non-trivial poly- 
nomial functions g : F ^ G to some nilpotent group. However group homomorphisms 
are always polynomial, see IIZK12II . 



NILPOTENT ERGODIC AVERAGES ALONG F0LNER NETS 



7 



5. Complexity 

In this section we give a steamHned treatment of the notion of complexity due to 
Walsh [|Wall2[ Section 4] . It serves as the induction parameter in the proof of our 
main theorem. 

An ordered tuple g = (go, •••,§;) of measurable mappings from F to G in which 
go = Ig is called a system (it is not strictly necessary to include the constant mapping 
^0 in the definition but it comes in handy in inductive arguments). 

The complexity of the trivial system g = (Ig) is by definition at most zero, in sym- 
bols cplxg < 0. A system has finite complexity if it can be reduced to the trivial system 
in finitely many steps by means of two operations, reduction (used in Proposition 19.41 1 
and cheating (used in Theorem 19.1 I I. 

For a, b e F we define the (a, byreduction of mappings g,h: F ^ G to be the map- 
ping 

{S\h)a,b (") = D,^,ig-'Xn)T,^,Kn) = gMgianbT'Kanb) 
and the (a, b)-reduction of a system g = (go, . . . , gj) to be the system 

Slb ■■= S' W (g; lg')„_^ = (go, ■ • ■ , gj-1, {gjlSo}^^^ , • ■ ■ , {sMj-l)a,b) ' 

where we use the shorthand notation g' = (go, • • • ,gj-i) and (gj|(go,---,gj-i)) = 
((gjigo) (gjlgj-i)) and where the S3mibol "i+i" denotes concatenation. If the re- 
duction g* has complexity at most c — 1 for every a, b e F then the system g is defined 
to have complexity at most c. 

Furthermore, if g is a system of complexity at most c and the system g consists of 
functions of the form gc, where g e g and c e G, then we cheat and set cplxg < c. 
This definition tells that striking out constants and multiple occurrences of the same 
mapping in a system as well as rearranging mappings will not change the complexity 
and adding new mappings can only increase the complexity, for example 

Cplx(lG, g2, giC, gi, C') = Cplx(lG, gi, g2) < CpMlg, gi, g2, gs)- 

Note that cheating is transitive in the sense that if one can go from system g to system 
g in finitely many cheating steps then one can also go from g to g in one cheating step. 

In general a system need not have finite complexity. We record here a streamlined 
proof of Walsh's result that that every pol)momial system does have finite complexity. 
We say that a system (go,---,gj) is G,-polynomial for a prefiltration G. if each g, is 
G, -polynomial in the sense of Definition |4.2| 

Theorem 5.1. The complexity of every G,-polynomial system g = (gg, ... ,gj) is bounded 
by a constant c(d, j) that only depends on the length d of the prefiltration G, and the size 
j of the system. 

The proof is by induction on d. For induction purposes we need the formally 
stronger statement below. We use the convenient shorthand notation g(ho, . . • ,bj.) = 
(gho, • • • , gfi J- 

Proposition 5.2. Let g = (gg, . . . , gj) be a G,-polynomial system. Let also hg, . . . , hj be 
G,^i-polynomial systems and assume cplxh^ < Cj. Then the complexity of the system 
g = ggho W • • • W gjhj is bounded by a constant c' = c'{d,j, |hol, . . . , |hj_i|, c^), where d is 
the length of G,. 

The induction scheme is as follows. Theorem lS.ll with length d — 1 is used to prove 
Proposition IS . 2 I with length d that in turn immediately implies Theorem lS.ll with length 
d. The base case, namely Theorem lS.ll with d = — oo, is trivial and c(— oo, j) = 0. 
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Proof of Prov. \5.2\ assuminz Thm. \5.1\f or lensth d — 1. It suffices to obtain a uniform 
bound on the complexity of g* for every reduction g* = g* possibly cheating first. 
Splitting hj = h'j i+i (h) (where h^. might be empty) we obtain 

(5.3) g* = goho W • • • W gj_ihj_i w gjh; U {gjMgoho w • • • W gj_ihj_i W g^h^) . 
Note that for every G.+j -polynomial h' we have 

(5.4) {gjh\g^h') = g^KTgjThT^TgjTh! = g^KThT^Th' = g^ {h\h') 
and 

(5.5) {g^h\g,h') = D{h-^g-')TgJh' = D^h-^g-')g,DgJh' 

= gMh-'g-'mh-'g-'lg,]Dg,Th' = g~h 

where h is a G.+i -polynomial by Theorem |4.5[ By cheating we can rearrange the terms 
on the right-hand side of (15. 3D obtaining 

(5.6) cpbcg* < cplx (goho W • • • W gj_ih,_i W g^ (h^ W (^1^;))) 

for some G,+i-pol5momial systems ho,...,hj_i with cardinality 2|ho|,...,2|hj_;^|, re- 
spectively. 

We use nested induction on j and Cj . In the base case j = we have g = ho and we 
obtain the conclusion with 

c'(d,0,Co) = Co. 

Suppose that j > and the conclusion holds for j — I. If = then by cheating we 
may assume hj = {1q). Moreover (15.61) becomes 

cpkg* < cplx (goho tt) • • • i±i g;-ih;-i) . 

The induction hypothesis on j and Theorem 15.11 applied to hj_i yield the conclusion 
with the bound 

c'(d,;,|hol,...,|h,_i|,0) = cXdJ - l,2|ho|,...,2|hj_2|,c(d - l,2|hj_i|)) + 1. 

If Cj > then by cheating we may assume h_, 7^ (Ig) and cplxh* < — 1, and (15.61 ) 
becomes 

cplxg* < cplx (goho W • • • W g;-ih;-i W gjh*] . 

The induction hypothesis on Cj now yields the conclusion with the bound 

c'idj, |hol, . . . , |hj_i|, c,) = c'idj, 2|ho|, . . . , 2\h^_,\, Cj - 1) + 1. □ 

Proof of Thm. 15. 1 \ assumins Prop, \5.2\f or lensth d. Use Proposition 15.21 with system g 
as in the hypothesis and systems hg, . . . ,hj being the trivial system (1g)- This yields 
the bound 

c(d,j) = c'(d,j,l,...,l,0). □ 

6. F0LNER SEQUENCES 

In this section we introduce the tools needed for induction along a F0lner sequence. 
From the several possible versions of the F0lner condition we pick the weakest that 
suffices for our purposes. Recall that F is a locally compact amenable group and | • | is 
a left Haar measure on F. 

Definition 6.1. A sequence (Fjv)jv of nonempty compact subsets of F is called a F0lner 
sequence if for every compact set K (ZT one has 

sup\lFi^AFt^\/\Ffj\^0 as Af ^ 00. 
/eJC 
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Every summing sequence [IPat88', Definition 4.15] is in particular F0lner in this sense 
and every cj-compact amenable group admits a summing sequence IIPat88l Theorem 
4.16]. If the group is not cj-compact one has to use nets instead of sequences. 

Note also that (F7v^'n)n is a F0lner sequence for any shifts (b^v)^ T whenever 
(Fjv)Ar is F0lner Thus we could replace the sets Qjv^n Theorem 1 1.1 1 bv sets a^Fj^^iV) 
but no additional generality would be gained by doing so. It will nevertheless be crucial 
to work with estimates that are uniform over such two-sided translates in the proof 

We denote sets of the form aPp^b, a, b e F, JV e N by the letter I and call them 
F0lner sets. For a F0lner set I we write [/] = JV if / = aF^ b for some a, b e F. 

By the F0lner property for every 7 > there exists a function c/?^ : N ^ N such that 

(6.2) sup |ZF^AF^|/|Fw| <r for every JV > (^/L). 

A finite measure set K is said to be y -approximately included in a measurable set /, in 
symbols K <^ I, if \K \I\/\K\ < y. The next lemma states that the family of F0lner sets 
is directed by ^-approximate inclusion. 

Lemma 6.3. For every y > and any compact sets I and I' with positive measure there 
exists a number with the property that for every N > there exists some 

b G F such that / F^b and I' <y Fn^- 

We use the expectation notation E„g;/(n) = for finite measure subsets 

/ c F, where the integral is taken with respect to the left Haar measure. Note that the 
expectation satisfies ^neaibfW = IE„gj/(anb) for any a, b e F. 

Proof. Let K (zThe compact and ^ > to be chosen later By the F0lner property there 
exists a number Nq such that for every Af > JVg we have |/Fjv n Fjv |/|F;^ | > 1 — /3 for all 
I e K. Integrating over K and using Fubini's theorem we obtain 

= Ei^pjKnFf,b-'\/\K\. 

Therefore there exists a b e F (that may depend onN >Nq) such that \KnF^b\/\K\ > 
I -P, so \K\Ff,b\/\K\< 13. 

We apply this with K = I U I' and 6 = yS!S!Mpl_ Let llj'ly := Afn as above and 

\K\ i 

N > Then for an appropriate b e F we have 

|/\F^b|/|7|<|K\F„b|/|7|<|K|^/|7|<r, 
and analogously for I'. □ 

7. The structure theorem 

The idea to prove a structure theorem for elements of a Hilbert space via the Hahn- 
Banach theorem is due to Gowers IIGowlOl Proposition 3.7]. The insight of Walsh 
[| Wall 21 Proposition 2.3] was to allow the "structured" and the "pseudorandom" part 
in the decomposition to take values in varying spaces that satisfy a monotonicity con- 
dition. 

His assumption that these spaces are described by norms that are equivalent to the 
original Hilbert space norm can be removed. In fact the structure theorem continues 
to hold for spaces described by extended seminorm^ that are easier to construct in 
practice as we will see in Lemma [831 

The Hahn-Banach theorem is used in the following form. 



An extended seminorm \\ ■ || on a vector space H is a function with extended real values [0, -l-oo] that is 
subadditive, homogeneous (i.e. ||Au|| = if A # 0) and takes the value at 0. 
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Lemma 7.1. Let V^, i = l,...,k, be convex subsets of a Hilbert space H at least one of 

which is open and each of which contains 0. Let V := CiVj H + C]^Vk with C; > and 

take f ^V. Then there exists a vector cf) such that {f,4>)>l and {v,(f)) < c~^ for 
every v e V; and every i. 

Proof By the assumption the set V is open, convex and does not contain / . By the 
Hahn-Banach theorem there exists a </) e H such that {f,4>) > 1 and (v, < 1 for 
every v e V. The claim follows. □ 

The next result somewhat resembles Tao's structure theorem [|Tao06ll , though Tao's 
result gives additional information (positivity and boundedness of the structured part). 
Gowers IIGowlOII described tricks that allow to extract this kind of information from a 
proof via the Hahn-Banach theorem. 

Structure Theorem 7.2. For every 5 > 0, any functions co,ip : N —> 'N and every M. e N 
there exists an increasing sequence of natural numbers 

(7.3) M. < Ml < • • • < Mp25-2i 

for which the following holds. Let 17: IR+ IR+ be any function and (|| • ||n)n6N be a 
sequence of extended seminorms on a Hilbert space H such that the sequence of dual 
extended seminorms (|| • ||JJ,)jv€N decreases monotonically. Then for every f ^ H with 
11/11 < 1 there exists a decomposition 

(7.4) f=a+u + v 
and anl<i < \25~^] such that 

(7.5) Walls <cf'\ < rjCcf'") and ||v|| < 5, 

where the integers A and B satisfy co(A) < M; and i/'CM;) < B and where the constant cf'^ 
belongs to a decreasing sequence that only depends on 5 and 17 and is defined inductively 
starting with 

(7.6) Cg_, = l by c,^-max{cf--^}. 

In the sequel we will only use Theorem 17.21 with the function coiA) = A, in which 
case we can choose A = M;, and with 5 and 17 as in ( 18. 3D . 

Proof It suffices to consider functions such that w(JV) > N and ip(N) > N for all N (in 
typical applications xp grows rapidly). 

The sequence (MJ and auxiliary sequences (A^), (B;) are defined inductively start- 
ing with Aj = M. by 

Mi = w(A,), B,=xp(iMi), A,+i=B„ 

so that all three sequences increase monotonically. Let r be chosen later and assume 
that there is no i e { 1 , . . . , r } for which a decomposition of the form ( I7.4D with A = A; , 
B = B, exists. 

For every ie{l,...,r}we apply Lemma [TTT] with Vj, Vj, ^"3 being the open unit balls 
of II ■ IIb ' II ■ IIX ^'^'^ II ■ II' respectively, and with Cj = Q, = rjCQ), C3 = 5. Note that 
V3 is open in H. We obtain vectors 4>i^H such that 

(,^„/)>i, ii<^,ii;^ <(c,r\ 110,111: <r}(c,r\ m\<5-\ 

Take i < j, then B, < A^ and by ( I7.6D we have 



(</'„<^;)i<ii<^,ii;.ii</';iii;<ii</',ii;.ii<^;ii:* 
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SO that 

2 

r^<{<t>, + --- + 4>„ff< + . . . + <^ J|2 < r5-2 + 
which is a contradiction if r > 25~^. □ 



8. Reducible and structured functions 

In this section we adapt Walsh's notion of a structured function and a corresponding 
inverse theorem to our context. Informally, a function is reducible with respect to a 
system if its shifts can be approximated by shifts arising from reductions of this system, 
uniformly over F0lner sets that are not too large. A function is structured if it is a linear 
combination of reducible functions. 



Definition 8.1. Let g = igo,---,gj) be a system, y > and JV e N. A function a 
bounded by one is called uniformly [g,Y,N)-reducible (in S3mibols a e ^g,Y,N^ 
every F0lner set / with 'fji[I\)<N there exist functions bQ,...,bj_i bounded by one, 
an arbitrary finite measure set J c F and some a e F such that for every / e I 



(8.2) 



j-i 



i=0 



<r- 



Walsh's definition of L-reducibility with parameter e corresponds to uniform (g,7,Af)- 
reducibility with N = i^y^L) and a certain j = 7(e) that will now be defined along with 
other parameters used in the proof of the main result. 

Given e > we fix 



e2 



(8.3) 5 = and iqix) -- ^3 

and define the decreasing sequence C^'^ > ■ ■ ■ > ^^25-^] ( 17. 6D . It is in turn used 



2^ ■ 3^x 

to define the function 



(8.4) y = yi(e)=— — , where C* = cf", 

and its iterates ^'^"''^(e) = ^'^(7). 

The ergodic average corresponding to a system g = (go, . . . , gj), bounded functions 
/o, . . . ,fj and a finite measure set / c F is denoted by 

; 

i=0 

and the difference between two such averages by 

<r Lfo, •■•,/;]= <[/0, ■ • ■ ,fj] - <[/0, •■•,/;]■ 

The inverse theorem below tells that any function that gives rise to a large ergodic 
average correlates with a reducible function. 

Inverse Theorem 8.5. Let e > 0. Suppose that \\u\\^ < 3C, the functions /o, 

are bounded by one and ||j^/[/o, . . ■ II2 > e 16 for some F0lner set I = aPf^b. 

Then there exists a uniformly {g,Y,N)-reducible function a such that {u,a) > 217(0). 
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Proof. Set bo := j//[/o, • ■ • u]/o/l|u|L, so that ||bolloo < 1, and := . . . , b^_^ := 
fj-i- Recall go = 1g and note that 

2r)(C)<||u||-i|K[/o,...,/,_i,u]||J 



= ( Kei 1 1 • gjMu, — 

\ ,=0 \W\oo 



We claim that cr is uniformly (g, 7, JV) -reducible. 

Consider a F0lner set aF^b with i/'yCL) < N. We have to show II8.2D for some J c F 
and every l^F^. By definition (16.21 ) of ^p^ we obtain 



f4 -1 I'^'nAFni 

o--E^€F«l lgj(a?mb) ^g,(aZmb)b; - 



i=0 



< 



Since gj^alb) is an isometric algebra homomorphism we see that gj[alb)a is uniformly 
approximated by 

i-i 

EmeF,Y\sM^~^^sM^'^^^~'sMmb)b,. 

i=0 

Splitting almb = ad~^ ■ alb ■ b~^mb we can write this function as 

i-i _ j-i 

^meF,Y\{Sj'S.)aa-\h-^mb ^) b, = E^^j,-!^^ , ]^ ( g,-, g, ) (fli b) b„ 

i=0 i=0 

which gives (I8.2D with J = b~^F,^b for the F0lner set / = aF^b. □ 

Structure will be measured by extended seminorms associated to sets S of reducible 
functions by the following easy lemma. 

Lemma 8.6 (cf CGowlOl Corollary 3.5]). LetH be an inner product space and E c H. 
Then the formula 

k-l k-1 

(8.7) ll/lls := inf {Xi l^tl : / = ^ 

t=o t=o 

where empty sums are allowed and the infimum of an empty set is by convention +00, 
defines an extended seminorm on H whose dual extended seminorm is given by 

(8.8) 11/11*:= sup |(/,0)|=sup|(/,a)|. 

i^eH:||(^||j<l o-€E 

Heuristically a function with small dual seminorm is pseudorandom since it does 
not correlate much with structured functions. 



9. Metastability of averages for finite complexity systems 

We come to the proof of the main result. Instead of Theorem 1 1 . 1 1 we consider the 
following quantitative statement that is strictly stronger in the same way as the quan- 
titative von Neumann Theorem [33] is strictly stronger than the finitary von Neumann 
Theorem [T2I 
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Theorem 9.1. For every complexity c e N and every e > there exists K^ ^ e N such that 
for every function F:N —>-N and every M e N there exists a tuple of natural numbers 

(9.2) M^'''\...,M^'''^ >M 

of size JC^.e that for every system g with complexity at most c and every choice of 
functions /q, ...,/, e L°°[X) bounded by one there exists 1 < i < K^^ such that for all 
F0lner sets 1,1' with M^'^'^ < [1\, [l'\ and \l,l'\c(^^-^ < F{M^'^'^) we have 

(9.3) ll^;',-[/o,-.-,/;]ll2<e. 

Recall that [/,/'] ^c(g) was defined in Lemma [631 Theorem 19.11 will be proved by 
induction on the complexity c. As an intermediate step we need the following. 

Proposition 9.4. For every complexity c e N and every e > there exists K^ ^ e N such 
that for every function f : N ^ N and every M e N there exists a tuple of natural numbers 

(9.5) M^'' Ml'' >M 

of size Kf. ^ as well as a number N = A/^ g jr(M) such that the following holds. For every 
system g such that every reduction g* ^ (a,b & F) has complexity at most c, every choice 
of functions fo,---,fj-i ^ bounded by one and every finite linear combination 

AfCTf of uniformly [g, y , N)-reducible functions there exists 1 <i < K^^ such that 



for all F0lner sets 1,1' with M-'^'^ < [/J, [,1'\ and r/,/'lr=+He) ^ F{M--^'^) we have 



(9.6) 



t ^ t 



The induction procedure is as follows. Theorem 19.11 for complexity c is used to 
deduce Proposition 19.41 for complexity c, which is in turn used to show Theorem 19.11 
for complexity c+ 1. The base case (Theorem 19.11 with c = 0) is trivial, take ^ = 1 
and M°-'-^ = M. 

Proof of Prop. 19.4 1 assuming Thm. \9.1\f or complexity c. The tuple ( I9.5D and the number 
JV will be chosen later For the moment assume that /,/' <^ 1q for some F0lner set /g 
with i4)j{.\.1q\) < N. Consider the functions bg, . . . , bounded by one, the set J'^ dV 
and the element a' e F from the definition of uniform (g, 7,JV)-reducibility of cTj over 
1q (Definition lS.lD . Write 0(x) for an error term bounded by x in L°°(X). By (18.21 ) we 
have 



j-i 



1 

W\ 



I ' Jnel 1=0 
j-1 f ;-l 



/n/o i=o V i=o J ' ' 



The first error term accounts for the L°° error in the definition of uniform reducibility 
and the second fc 
approximated by 



and the second for the fraction of / that is not contained in 1q. This can in turn be 



t4 / \ . I^n/ol |7\/ol 



i=0 i=0 



Ul 
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Using the analogous approximation for /' and summing over t we obtain 

(9.7) Ul,[f„...Jj-i,Y,^,o,]\\^ 

t 

< S WmeJ' IK^'"" [fo, ■ • -Jj-l, K'---' ^-l] II2 + 6r Xi I- 
t t 

If G is commutative and g consists of affine mappings then the maps that constitute 
systems g*, ^ differ at most by constants, and in this case one can bound the first 
summand by a norm of a difference of averages associated to certain functions on 
X X i+if J"^ similarly to the reduction in IITaoOSI Section 5] . In general we need (a version 
of) the more sophisticated argument of Walsh that crucially utilizes the uniformity in 
the induction hypothesis. The argument provides a bound on most (with respect to the 
weights |At|/|J'|) of the norms that occur in the first summand. 

Let r = r(c, e) be chosen later. We use the operation M >-» M^'"^'^' and the constant 
K = K^ y from Theorem 19.11 (with j in place of e) to inductively define functions 
F„...,Fiby 

F, = F, F,_i(M) = max F,(M,'=''''^0. 
Using the same notation define inductively for 1 < ij, . . . , <K the numbers 

The theorem tells that for every t, m and 1 < ii, . . . , is-i < K there exists some 1 < 
is <K such that 

(9.8) [fo, . . . ,fj-i, bl,..., 1I2 < r 

holds provided 

(9.9) Mft. -.''«) = (Mai.....;.-i))<=.r.fs < Ljj^ L/'J and < F,(M^'i--=^5). 

Start with s = 1. By the pigeonhole principle there exists an ij such that l\9.8t holds 
for at least the fraction 1/K of the pairs (t,m) with respect to the weights |Af|/|J^| 
(provided ( I9.9D with s = 1). 

Using the pigeonhole principle repeatedly on the remaining pairs (t, m) with weights 
|At l/lJ*^ I we can find a sequence ij, . . . , such that for all pairs but the fraction i^^Y 
the estimate ( 19. 8D holds provided that the conditions ( 19. 9D are satisfied for all s. 

By definition we have M < M^'i^ < m'-'^-'^'^ < < m'-'^--''^ and 

Fi(m('i^) > F2((m('i')'''''^^) = F2(m('i-'2^) > ■•• > F.CM^'i--''^) = F(m('i'-''^^) 

for any choice of ii,...,ir- Therefore the conditions ( I9.9D become stronger as s in- 
creases. Recall from 08. 4D that 7*^(7) = j'^'^^ie), thus we only need to ensure 

(9.10) Mft. -.',) < L/j, L/'j and [7,7'l^c+i(,) < FCM^'i--''-^). 

This is given by the hypothesis if we define the tuple 09. 5D to consist of all numbers 
M(-h,-Jr) where e {l,...,K}, soK^^ = iK^^yJ. 

We now choose r to be large enough that i^^^Y < T- Then the sum at the right- 
hand side of 09. 7D splits into a main term that can be estimated by 7 ^ J | using 09. 8D 
and an error term that can also be estimated by fUt l-^tl using the trivial bound 

\\-^j;>~[fo,---jj-uK'---'bU^\\2<'^- 

Finally the second condition in ( I9.10D by definition means that there exists a F0lner 
set Iq such that [Iq\ = f(m'^'i--''^) and /,/' <yc+i,^^^ Iq. In particular we have /,/' <^ Iq 
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since y'^^^i^) — T^(e) = j- Taking 

N = max 03^(F(M'^'i--''^)) 

l<ii,--;ir<K ' 

guarantees (/'^(L^oJ) — ^- 1^ 

Proof of Thm. 19. 1 { assumins Prop. \9A\f or complexity c — 1. Let c, e, F and a system g 
with complexity at most c be given. By cheating we may assume that every reduction 
g* J, (a, b e r) has complexity at most c — 1. 

We apply the Structure Theorem |7.2| with the following data. The sequence of 
extended seminorms || ■ ||f^ = II ■ |lEg,,„ is given by Lemma [831 the dual extended 
seminorms || • ||^ = || • ||^ ^ decrease monotonically since Sg,y_N' c Sg_^^w whenever 

JV' > JV. The function ipiM) := iV,. g^(M) is given by Proposition [2]4] with c, e, F as in 
the hypothesis of this theorem. Finally, co(A) = A and M. = M. The structure theorem 
provides a decomposition 

(9.11) fj=J]x,a,+u + v, 

t 

where ^tl^tl < cf'" =: C, < C*, a, e Sg,^^^, ||u||;^^ < rj(C,) and ||v||2 < 5. Here 
''/'(M,) < B and the number M; > M. = M comes from the sequence ( I7.3D that depends 
only on ip, M and e and whose length [25"^] depends only on e. Note that in turn 
depends only on c, e and F. 

We will need an L°° bound on u in order to use the Inverse Theorem 1 8. 5 1 For this 
end let S = {\v\ < CJ cX, then 

|u|ls<ls+2|A,|ls + |v|ls<3C,. 

t 

Moreover the restriction of u to is bounded by 

|u|lsc < IsC +Xi l^tUsC + Mlsc < 3|v|lsc, 
t 

so it can be absorbed in the error term v. It remains to check that Ijiil^H^ is small. 

By Chebyshev's inequality we have CffiiS^) < \\v\\l, so that fiiS^Y^^ < Let now 
CT e Sjvf. be arbitrary and estimate 

I {u1s,(t) \<\{u,a)\ + \ (ulsc,c7) | < ||u||;;,^ + Hul^c ||2||ct1sc II2 

< r}(C,) + 3||v||2/i(S^)i/2 < r}(C,) + 35 ■ 5/C, < 2r](C,). 

Thus (replacing u by ul^ and v by v + ul^c if necessary) we may assume ||u||oo ^ 3C, 
at the cost of having only ||u||]^. < 2i7(Q) and ||v||2 < 45 < e/6. 

Now we estimate the contrilDutions of the individual summands in ( I9.11D to ( I9.3D . 
The bounds 

lK[/o,---,/;-i,v]||2<g and ||<[/o,...,/;-l,v]||2<^ 

are immediate. Proposition |9]4] for complexity c— 1 with M = M; (applicable since the 
functions are uniformly (g,7,')/)(Mi))-reducible) shows that 



provided that the F0lner sets /,/' satisfy 



m-'-'-' < and r7,/'i,c(,) < Fm;-'-''') 
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for some m5 ^''^'^ that belongs to the tuple ( I9.5D given by the same proposition. The 
former condition implies in particular M; < Mf'^'"^'^ < [/J, [/'J, and in this case the 
Inverse Theorem 18 . 5 1 shows that 

lK[/o,---,/;-i,"]ll2<^ and lK[/o,...,/;-i,u]|l2< J, 

since otherwise there exists a uniformly (g, y, M;)-reducible function a such that (u, a) > 
2rj(Q). 

We obtain the conclusion of the theorem with the tuple ( I9.2D being the concatena- 
tion of the tuples ( 19. 5D provided by Proposition 19.41 with M = M, > M for 1 < i < 
[25-2]. particular K^ ^ = [25-21^^ ^-1- □ 

This completes the induction and thus the proof of Proposition |9.4| and Theorem |9.1l 

Corollary 9.12. Let g = [gQ, ... , gj) be a system with finite complexity and /q,... ,fj e 
L°°(X) be bounded fiinctions. Then for every F0lner sequence (fjv)iv T the limit 

(9.13) hm ^/[/o,...,/;] 

UJ^oo 

exists in L^iX) and is independent of the F0lner sequence (fjv)iv- 

Proof We may assume that the functions /g, ...,/, are bounded by one and cplxg < c 
for some c < oo. We use the abbreviations 

g/(m) = goim)fo gjim)f^, gfU) = ^n,eiSf(m) = j^fU^, . . . ,/,]. 

Assume that the functions g/(/) do not converge in L^[X) as [/J oo. Then there 
exists an e > such that for every M e N there exist F0lner sets /, /' such that M < 
and 

iig/a) - g/(nii2 = [^0' ■ • ■ 'fi^ 1I2 > 

If one takes F(M) := [/,/']yC(e) this contradicts Theorem |9.1l Therefore the limit 

g/(r):= hm g/(7) 

UJ^oo 

exists. The uniqueness is clear for limits along Folner sequences since any two such 
sequences are subsequences of some other Folner sequence. The advantage of the 
averaging argument below is that it also works for nets. 

Let (Fjv)n be another F0lner sequence in P. Let e > be given and take so large 
that 

||g/a)-g/(r)||2<e 

whenever [/J >N^. 

Let JV be so large that |nF^AF^| < e|F^| for every n e . Then 

||E„€F„g/(m) - E„g„j.^g/(m)||^ < e 

for every n e Fjy . Writing 0(e) for an error term that is bounded by e in L^[X) we 
obtain 

EmsF^g/(m) = E„gj,^^E„gp^g/(m) 

= IEneF«^E„e„P^g/(m) + 0(e) 
= En€F„,E„€F„g/(nm) + 0(e) 
= ^meF,KeP,Sfinm) + Oie) 

= E^6F,g/(-Fiv,"i) + 0(e) 
= E^eF,g/(n + 0(2e) 
= g/(r) + 0(2e). 
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Since e is arbitrary the averages g/(F^) converge to g/(r) as N ^ oo. □ 

Theorem 11.11 follows immediately from Theorem 15 . 1 1 applied to the polynomial sys- 
tem g = (go, • • • , gj), where go = Ig, and Corollary [£32] 

Appendix A. Right polynomials and commuting group actions 

An inspection reveals that every occurrence of a e F and related objects d, ai,a'^ ,aj^, 
r in this article could be replaced by Ip (in fact we could restrict their values to any 
subgroup of r but we do not use this). This leads to the notion of right translation and 
right derivative of a mapping g: F ^ G that are defined by 

Tbgin) = ginb) and Di,g(n) = g(n)"ig(nb), 

respectively. Right polynomials, right reduction (•,•);, and and right complexity of 
systems are defined similarly to pol5momials, reduction and complexity, respectively, 
with right derivatives in place of derivatives. Right F0lner sets are sets of the form 

With these definitions we obtain an analog of Corollary |9.12| with complexity re- 
placed by right complexity and F0lner sets replaced by right F0lner sets and an analog 
of Theorem 15 . 1 1 with pol)momials replaced by right polynomials. Together they imme- 
diately imply the following analog of Theorem 1 1 . 1 1 for right polynomials. 

Theorem A.l. Let gi,---,gj' F —> G be measurable right polynomial mappings and 
fi,... ,fj e L°°(X) be arbitrary bounded functions. Then for every F0lner net (Fa)aeA ^^i 
F the limit 



(A.2) lim- 



1 



gi(m)/i gj(m)/, 

meF,, 



exists in L^(X) and is independent of the F0lner net (Fa)aeA- 

A second application of the analog of Corollary |9.12| described above deals with 
commuting actions of F without any further assumptions on the group G generated 
by the corresponding unitary operators. Note that a group action T; gives rise to an 
antihomomorphism g^: F G, g,(ni)/ = f ° T;(m), i.e. a mapping such that gi(nb) = 
g[ (b)g, (n) for every n, b e F. 

Proposition A.3. Let gg = 1^ arid gi, . . . , gj : F G be antihomomorphisms that com- 
mute pairwise in the sense that gi(n)gj,(t)) = gkib)gi{n) for every n, b e F provided 
i ^ k. Then the system (gg, gogi, ■ ■ ■ , go ■ ■ ■ Sj) ^'^^ right complexity at most j. 

Proof Every antihomomorphism g; : F ^ G satisfies 

Ddg-'Xn) = g,(n)g,(nb)-i = g,mg,[b)g,{n)r' = g^ib)-' 

and 

TbgiM = g,("b) = gi(b)gi(,n). 
Thus for every i < j we have 

(go • ■ ■ gj, go---g,)^= £'b((go • ■ ■ gjT^)T,,igo . . . g,) 

= (go(b)---g;(b)r'go(b)go---g,('')g= 

= go.-.g.g,+i(br^..g;(br^ 

Since g;+i(b)~^ . . . gj(b)~^ e G is a constant we obtain 

Cplx(go, gogl, • • • , go • • • g;)b = Cplx(go, gogl, • • • , gl • • • gj-l) 

by cheating. We can conclude by induction on j. □ 

Theorem ll.3l follows from Proposition lA.3l and the analog of Corollarv l9.12| for right 
complexity. 
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